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effects. The size of the fluctuations arising in the game is controlled by the 



ners (anticrowds) . The theoretical results are in good agreement with the 
numerical simulations over the entire parameter range of interest. 
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The study of agent-based models of complex adaptive systems is attracting much at- 
tention [|T|. Among the many possible interdisciplinary applications is the growing field of 
econophysics || : each agent knows the past ups and downs in the index of a financial market 
and must decide how to trade based on this global information. The Minority Game (MG) 
introduced by Challet and Zhang offers arguably the simplest paradigm for such a 

complex, adaptive system. The MG comprises an odd number N of agents, each equipped 
with s strategies and a memory size m, who repeatedly compete to be in the minority |]3| [ToU . 
The most striking feature arising from numerical simulations is the non-monotonic variation 
in the size of the fluctuations (i.e. standard deviation) produced by the MG as m is varied 
0]. Challet et al provided a sophisticated formal connection between the MG and spin glass 
systems || which offers many fascinating quantitative insights into the MG's dynamics. 
Given the complexity of this dynamics, it is understandable that no general theory has yet 
been proposed which yields quantitative agreement with the numerical results of Ref. 0] 
over the full range of m, s and N values. 

In this paper, we show that a theoretical model can be constructed in a surprisingly 
simple way by incorporating the 'crowd' effects (i.e. strong inter-agent correlations) which 
arise within the interacting, many-agent population. The results yield good agreement with 
the numerical results [§J over the entire range of m, s and N. The non-monotonic behaviour 
of the standard deviation H[| is shown to arise from a fascinating interplay between a crowd 
and its anti-correlated partner ('anticrowd'). 

The MG comprises an odd number of agents iV who choose repeatedly between option 
(e.g. buy) and option 1 (e.g. sell). The winners are those in the minority group, e.g. sellers 
win if there is an excess of buyers. The outcome at each timestep represents the winning 
decision, or 1. A common bit-string of the m most recent outcomes is made available 
to the agents at each timestep. The agents randomly pick s strategies at the beginning of 
the game, with repetitions allowed. After each turn, the agent assigns one (virtual) point 
to each of his strategies which would have predicted the correct outcome. At each turn of 
the game, the agent uses the most successful strategy, i.e. the one with the most virtual 
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points, among his s strategies. The strategy-space V m forms a 2 m -dimensional hypercube 
with strategies at the 2 2 " 1 vertices. Fortunately, the MG's standard deviation is essentially 
unchanged if a 'reduced' strategy space U m is used instead of V m 0: the U m only contains 
2 m+1 strategies or equivalently 2 m strategy pairs {Q}- The two strategies within a given 
pair Q are anticorrelated, i.e. they differ by the maximum possible Hamming distance 
d H = 2 m [[§. Strategies between any two pairs Q and Q' are uncorrelated, i.e. they differ by 
dn = 2 m_1 . The results presented in this paper employ the reduced strategy space. 

If tir agents use the same strategy R, then they will act as a 'crowd', i.e. they will make 
the same decision. If n R agents simultaneously use the anticorrelated strategy R, they will 
make the opposite decision and will hence act as an 'anticrowd' (Q = (R, R)). If Ur « n R 
for all Q, then the actions of the crowds and anticrowds cancel and the standard deviation 
a of the number of agents choosing a given option (the so-called 'attendance' time-series 
A(t)) will be small. In contrast if n R ^> n R for all Q, then a will be large. Since there is no 
correlation between Q and Q', each group Q comprising a crowd-anticrowd pair (n^,n^) will 
contribute to the attendance A(t) via a separate random walk in time of step-size \tir — tir\. 
The variances of these walks can then be summed to obtain the standard deviation of A(t). 
Hence we describe the MG standard deviation using the following theoretical expression: 
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where both time-averaging, for a given configuration of initial strategies, and configuration- 
averaging have been carried out. We now demonstrate that this crowd-anticrowd cancella- 
tion underlies the numerical results for a vs. m [3J. We run the numerical simulation of 
the MG and wait until transients in A(t) have disappeared. At timestep to, we read out the 
number of players playing each strategy R, where R — 1,2,... 2' m+1 . For each strategy pair 
Q = (R, R), we calculate riR — n R at time t and hence obtain a. We then average this a over 
1000 timesteps to simulate the time-averaging. We have checked that our results are insensi- 
tive to the precise time-averaging procedure. Finally, we average over 16 runs to simulate the 
configuration-averaging. Figure 1 compares the resulting time and configuration-averaged a 

3 



with that obtained from the numerical MG simulation. The agreement is very good for all 
m, s and N (not shown). We conclude that the crowd-anticrowd cancellation can indeed 
quantitatively explain the numerical results of Ref . [|J . 

We need expressions for the number of agents using each strategy, i.e. {ur}. Since 
the labels R are arbitrary in Eq. (1), the ordering of strategies {ur} has no particular 
significance. At any particular time to, these 2 m+1 strategies can be ranked according to 
their virtual points by a sort-operation acting on the list {ur}. Hence {ur} h- > {n p } 
where p is the virtual-point rank label with p = 1 being the highest scoring strategy and 
{n p } = n p= i,n p= 2, ■ ■ .n p=2 ™+i. Another useful method of ordering is achieved by ranking 
strategies {n R } at time t according to their popularity. In particular, strategy r = 1 is 
defined as the strategy which is being used by the largest number of agents, strategy r = 2 
is being used by the second largest number of agents, etc. We denote the popularity ranking 
by {n r } where {n r } = n r= \,n r =2, . . .n r= 2m+i. Note that {n r } can be obtained from {%} 
and hence {n p } by sort operations, i.e. {ur} i— > {n r j and hence {n p } \— > {n r }. Each agent 
plays the available strategy having highest virtual points; this allows an analytic expression 
to be obtained for the probability that an agent plays a given strategy for general s, which 
in turn yields 



where J2r=i n r = N as required. Since agents are discrete objects, the simulation tends 
to produce discrete steps in the curves of n r as a function of r. This effect becomes more 
pronounced as m increases since the total number of strategies 2 m+1 then exceeds the pop- 
ulation size N. We therefore convert the theoretical n r values of Eq. (2) to an integer. For 
large m such that 2 m+1 ^> N, the resulting theoretical values are typically n r ~ 1 for small 
r and n r = for r > N. Figure 2 compares the theoretical values of n r calculated using 
Eq. (2) with s = 2 and iV = 101, to numerical values taken from the MG simulation. The 
agreement is good. 

For the virtual-point ordered list {n p }, the strategy p' = 2 m+1 + 1 — p is always anticor- 




(2) 
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related to the strategy p, i.e. p' = p. Hence knowledge of the sort operation T completely 
determines where each strategy's anticorrelated partner is located in the popularity-ordered 
list {n r }. Since we are here only interested in time-averaged and run-averaged a, we only 
need to consider the probability distribution of locations of f in the popularity-ordered list 
{n r }. We therefore replace the sort operation T by a probability function P(r' = f) which 
gives the probability that any strategy r' is the anti-correlated partner of strategy r in the 
list {n r }. Hence Eq. (1) becomes 

2 m + 1 2 m+1 i -, I 

a= -E E K-tvl^^f) 2 (3) 

LZ r=l r'=l ^ J 

where the factor \ discounts double-counting. There are two limiting cases. When the 
virtual-point ordered list {n p } and the popularity-ordered list {n r } are identical, then P{r' = 
f) will be a ^-function at r' = 2 m+1 + 1 — r and hence Eq. (3) has the same form as Eq. (1). 
In the opposite case where the two ordered lists are uncorrelated, P(r' = f) should be a flat 
distribution. In each case, Eqs. (2) and (3) can be combined to obtain closed-form analytic 
solutions for arbitrary s and N. 

Figure 3 shows P(r' = f) for r = 1 as a function of r', taken from the numerical MG 
simulation at m = 2, 5 and 10. For small m (m — 2) the anticorrelated strategy to the most 
popular strategy (i.e. r = 1) is at r' = 2 m+1 , i.e. it is the least popular strategy. Hence 
P{r' = f) resembles the 5-function limiting case mentioned above. From Fig. 2 we know that 
very few agents will therefore pick this anticorrelated strategy. Hence the crowd-anticrowd 
cancellation will be small and a will be large, as can be seen in Fig. 1. As m increases 
(m = 5) a remarkable effect occurs: the peak in P(r' = f) moves up toward r — 1. Hence 
both r = 1 and its anticorrelated partner f are now very popular. Whereas for m = 2 it 
seemed like there was an effective 'repulsion' between r and f, for m = 5 this now seems 
more like an attraction. Amusingly, the shape of P(r' = f) for m = 5 is reminiscent of the 
screening effect of a negative charge cloud around a positive charge placed at r = 1, or even 
a bound electron-hole pair (i.e. exciton) with the crowd (anticrowd) playing the role of the 
positive (negative) charge. For large m (m — 10), the ability of the anticrowd to 'screen' 



the crowd has decreased yielding a rather flat distribution as shown. The consequence of 
this strong crowd- ant icrowd correlation which appears as m increases, is that the crowd and 
anticrowd become comparable in size. Hence a is small for m ~ 5 — 6, in agreement with 
Fig. 1. Note that the MG cannot fully 'optimize' itself by building equal-sized crowds and 



anticrowds. In modified models of MG [EC where this in-built frustration in the strategy 
space is allowed to relax, equal-sized crowds and anti-crowds do naturally emerge. 

We now consider analytic expressions for a for general s using Eqs. (2) and (3). For 
small m, the virtual-point ordered list {n p } and the popularity-ordered list {n r } will be very 
similar, hence P(r' = f) ~ 5 r r 2 m+i + i_ r . The discreteness of the agents will be unimportant 
since n r 3> 1, hence n r can be treated as continuous. Equations (2) and (3) yield 
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For s = 2 this becomes 
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Figure 4 shows these analytic curves for s = 2 and s = 4 (solid lines monotonically decreas- 
ing). As might be expected using the extreme ^-function form for P(r' = f), these curves are 
slightly higher than the numerical results in Fig. 1 for small m. Now consider the opposite 
extreme of uncorrelated r' and f, i.e. the flat distribution P(r' — f) ~ 2~( m+1 \ For s = 2 
this gives 

N 
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(6) 



Equation (6) typically produces lower estimates for each s value (dashed lines). Values of 
a obtained from separate numerical runs tend to be scattered in the region of these curves. 
For larger m (m > 6) we cannot ignore the discreteness of the agents (Fig. 2). In this 
regime, n r ~ 1 for r < while n r = for r > N. Using the integer form of Eq. (2) for high 
m, and the flat distribution for P{r' = f), yields 
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This expression approaches the coin-toss limit from below as m — > oo, as shown in Fig. 
4 (solid line monotonically increasing). Within the approximation used here, this curve is 
insensitive to s. Note that the numerical MG results are also consistent with this finding 
of weak s-dependence for large m. Hence Fig. 4 gives a clear picture of what happens to 
a as s and m increase: considering the monotonically decreasing curve for low m, and the 
monotonically increasing curve for high m, we see that (a) there should be a minimum in a 
for s = 2, (b) this minimum should move to higher mass increases and (c) the minimum 
should become shallower as s increases. Each of these statements agrees with numerical 
simulation results (c.f. Fig. 1). In addition, the curve (Thigh m — * at N = 2 m+1 , i.e. for 
m ~ 5 — 6. 

We now compare the theoretical crowd- ant icrowd calculation (Eq. (3)) with the numer- 
ical simulation. The most interesting case is s = 2. Figure 5 shows the spread of numerical 
values for different runs (open circles) compared to theory (solid circles). The agreement is 
good. The appropriate analytic expressions for the probability function P(r' = f) in Eq. (3) 
involve multiple sums and are complicated: we therefore obtained the results for each m in 
Fig. 5 by generating the corresponding P(r' = f) forms to those in Fig. 3. The theoretical 
points tend to lie in between the limiting curves of Fig. 4 except at the minimum where 
the remarkable form for P(r' = f) (recall Fig. 3) pulls the theoretical value slightly below 
the cruder limiting curves of Fig. 4. Elsewhere we will discuss simpler closed-form expres- 
sions for P(r' = f), together with an analysis of the time-series A(t) and history bit-string 
statistics within the crowd- ant icrowd model . 



In summary we presented an analytic analysis of crowding effects in MG which offers a 
novel explanation of the main finding of Ref. j|. Future work will explore possible connec- 
tions to the spin-glass formalism of Challet et al 0. 

We thank T.S. Lo, J. P. Garrahan, D. Sherrington and D. Challet for discussions. 
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FIGURES 

FIG. 1. Standard deviation a for the Minority Game as a function of memory-size m for 
s = 2,3,4 strategies per agent and N = 101 agents. Solid curve: numerical simulation. Dashed 
curve: crowd-anticrowd theory using Eq. (1). Random (coin-toss) limit a = y/N/2 = 5.0 is 
indicated. 

FIG. 2. Histograms of the number of agents using strategy r for r = 1 (most popular) to 
r = 2 m+1 (least popular). Results are shown for m = 2 (left scale) and m = 7 (right scale). Solid 
lines: numerical simulation. Dashed lines: theory from Eq. (2). 

FIG. 3. Probability function P{r' = f) giving the probability that the strategy ranked r' on 
the popularity-ordered list, is anti-correlated with the strategy ranked r. Results are shown for 
r = 1 (i.e. most popular strategy) as a function of r' for m = 2 (dotted-dashed), m = 5 (dotted) 
and m = 10 (solid), s = 2 and N = 101. Note that £ r , P(r' = r) = 1. 

FIG. 4. Theoretical curves for a using Eqs. (4)-(7). Monotonically decreasing curves for s = 2, 4 
at low m (m < 6): solid lines correspond to (5-function P(r' = f) distribution neglecting agent dis- 
creteness, dashed lines correspond to flat distribution neglecting agent discreteness. Monotonically 
increasing solid line for large m (m > 6) is independent of s: it corresponds to a flat distribution 
and accounts for agent discreteness. 

FIG. 5. Theoretical crowd-anticrowd calculation (solid circles) vs. numerical simulations (open 
circles) for s = 2, N = 101. 16 numerical runs are shown for each m. 
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